Computational identification of transcription factor binding sites by functional analysis of set of genes sharing overrepresented upstream motifs in yeast S.cerevisiae
نویسندگان
چکیده
Transcriptional regulation is a key mechanism in the functioning of the cell, and is mostly effected through transcription factors binding to specific recognition motifs located upstream of the coding region of the regulated gene. The computational identification of such motifs is made easier by the fact that they often appear several times in the upstream region of the regulated genes. In this poster we present a computational method to construct sets of genes characterized by the statistical overrepresentation of a certain motif in their upstream region, and a method to analyze the functional characterization of these sets by analyzing their annotation to Gene Ontology terms. For the sets showing a statistically significant specific functional characterization, we conjecture that the upstream motif characterizing the set is a binding site for a transcription factor involved in the regulation of the genes in the set. Many known binding sites are identified, and a few new candidates..
منابع مشابه
Identification of human transcription factor binding sites by comparative genomics
Understanding transcriptional regulation of gene expression is one of the greatest challenges of modern molecular biology. A central role in this mechanism is played by transcription factors (TF) which typically bind to specific, short DNA sequence motifs which are usually located in the upstream region of the regulated genes. We discuss here a simple and powerful approach for the identificatio...
متن کاملMethod for identifying transcription factor binding sites in yeast
MOTIVATION Identifying transcription factor binding sites (TFBSs) is helpful for understanding the mechanism of transcriptional regulation. The abundance and the diversity of genomic data provide an excellent opportunity for identifying TFBSs. Developing methods to integrate various types of data has become a major trend in this pursuit. RESULTS We develop a TFBS identification method, TFBSfi...
متن کاملA computational approach to regulatory element discovery in eukaryotes
Gene regulation in Eukaryotes is mainly effected through transcription factors binding to rather short recognition motifs generally located upstream of the coding region. We present a novel computational method to identify regulatory elements in the upstream region of Eukaryotic genes. The genes are grouped in sets sharing an overrepresented short motif in their upstream sequence. For each set,...
متن کاملComputational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae.
AlignACE is a Gibbs sampling algorithm for identifying motifs that are over-represented in a set of DNA sequences. When used to search upstream of apparently coregulated genes, AlignACE finds motifs that often correspond to the DNA binding preferences of transcription factors. We previously used AlignACE to analyze whole genome mRNA expression data. Here, we present a more detailed study of its...
متن کاملDiscovering Transcription Factor Binding Motif Sequences
Introduction In biology, sequence motifs are short sequence patterns, usually with fixed lengths, that represent many features of DNA, RNA, and protein molecules. Sequence motifs can represent transcription factor binding sites for DNA, splice junctions for RNA, and binding domains for proteins. Thus, discovering sequence motifs can lead to a better understanding of transcriptional regulation, ...
متن کامل